SEQUENCE LISTING 



<110> Tchaga, Grigory S. 
Jokhadze, George 

<120> Metal Ion Affinity Tags and Methods for 
Using the Same 



<130> CLON-056CIP 

<140> US 09/858,332 
<141> 2001-05-15 

<150> 09/404,017 
<151> 1999-09-23 

<150> 60/101,867 
<151> 1998-09-25 

<160> 34 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 1 

His Leu lie His Asn Val His Lys Glu Glu His Ala His Ala His 
15 10 15 



<210> 2 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 2 

His Asp Asp His Asp Asp His Asp Asp His Asp Asp His Asp Asp 

15 10 15 

Asp Asp 



<210> 3 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 



<400> 3 



His Glu Glu His Glu Glu His Glu Glu His Glu Glu His Glu Glu His 

1*5 10 15 

G1U Glu 



<210> 4 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 4 

His Asp Glu His Asp Glu His Glu Asn His Glu Asn His Glu Asp His 

15 10 15 

Glu Asp 



<210> 5 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 5 

His Glu Asp His Glu Asp His Glu Asp His Glu Asp His Glu Asp His 

15 10 15 

Glu Asp 



<210> 6 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 6 

Asp Asp Asp Asp Lys 
1 5 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enterokinase cleavage site 
<400> 7 

lie Glu Gly Arg 
1 



2 



<210> 8 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a factor Xa cleavage site 
<400> 8 

Leu Val Pro Arg Gly Ser 
1 5 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a thrombin cleavage site 
<400> 9 

His Pro Phe His Leu Val lie His 
1 5 



<210> 10 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a renin cleavage site 
<400> 10 

Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
15 10 



<210> 11 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 11 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 12 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 12 

Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 

3 



15 10 



<21'0> 13 
<211> 3426 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> DNA sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 13 

gacgaaaggg 

cttagacgtc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 

gagccggtga 

cccgtatcgt 

agatcgctga 

catatatact 

tcctttttga 

cagaccccgt 

gctgcttgca 

taccaactct 

ttctagtgta 

tcgctctgct 

ggttggactc 

cgtgcacaca 

agctatgaga 

gcagggtcgg 

atagtcctgt 

gggggcggag 

gctggccttt 

ttaccgcctt 

cagtgagcga 

cgattcatta 

acgcaattaa 

cggctcgtat 

accatgatta 

gctcatgccc 

ccttgggtcg 

agggattggc 

tggaaagcag 

aggttgattg 

attgccatga 

ttaccagaag 

gcacttatat 

tcaaatgaga 

gcaggctatg 

tgccaagaaa 



cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgccaagctt 
acaacaagat 
ttgctctgta 
tggtgtcggc 
tgctaggcct 
accaaattgt 
tgcatcttga 
aaaatcaagt 
atcaaggttc 
aatgtcaaca 
aagcaggagg 
acaacagatg 



cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
gaaggatcat 
cgatattgtc 
tttcgacgat 
cgcccactgc 
gcatatggca 
cataaaccca 
aatgaaagtg 
ttttccccca 
tactgcagac 
acagatgcca 
ggtagattct 
gctcctggct 



tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 
ctcatccaca 
ggaggaagtg 
caacaggtct 
gtgtacggga 
tcaaatctga 
cactacaata 
aactacacag 
ggaagaattt 
gtactgcaag 
gaatataaca 
tgtcaggggg 
ggcgtgacgt 
4 



tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaagcgggc 
ggctttacac 
tcacacagga 
atgtccacaa 
actccagaga 
gcggagcttc 
gaaatatgga 
cttctcctca 
aacggagaaa 
attatataca 
gttctattgc 
aagctgacgt 
ttacggaaaa 
attcaggcgg 
catttggata 



ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagggt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 
agaggagcac 
aggagcctgg 
tctggtgagc 
gccgtctaag 
gatagaaact 
gaacaatgac 
gcctatttgt 
tggctggggg 
tccccttcta 
tatggtgtgt 
accactcatg 
tcaatgtgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 



ctgcctaatc gcccaggggt gtatgcccgg gtcccaaggt tcacagagtg gatacaaagt 3000 

tttctacatg agctcgtaat tagctgagaa ttcactggcc gtcgttttac aacgtcgtga 3060 

ct'gggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 3120 

ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 3180 

tggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 3240 

catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 3300 

cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 3360 

acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 3420 



acgcgc 



3426 



<210> 14 
<211> 269 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> protein sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 14 



Met 


Thr 


Met 


He 


Thr 


Pro 


Ser 


Leu 


Lys 


Asp 


His 


Leu 


He 


His 


Asn 


Val 


1 








5 










10 










15 




His 


Ly s 


Glu 


Glu 


His 


Ala 


His 


Ala 


His 


Asn 


Lys 


He 


Asp 


He 


Val 


Glv 








20 










25 










30 






Glv 


Ser 


Asp 


Ser 


Arg 


Glu 


Glv 


Ala 


Trp 


Pro 


Trp 


Val 


Val 


Ala 


Leu 


Tyr 






35 










40 










45 








Phe 


Asp 


Asp 


Gin 


Gin 


Val 


Cys 


Gly Ala 


Ser 


Leu 


Val 


Ser 


Arg 


Asp 


Trp 




50 










55 










60 










Leu 


Val 


Ser 


Ala 


Ala 


His 


Cys 


Val 


Tyr 


Glv 


Arg 


Asn 


Met 


Glu 


Pro 


Ser 


65 










70 










75 










80 




Trn 


Lys 


Ala 


Val 


Leu 


Gly 


Leu 


His 


Met 


Ala 


Ser 


Asn 


Leu 


Thr 


Ser 










85 










90 










95 




Pro 


Gin 


He 


Glu 


Thr 


Arg 


Leu 


He 


Asp 


Gin 


He 


Val 


He 


Asn 


Pro 


His 








100 










105 










110 






Tyr 


Asn 


Lys 


Arg 


Arg 


Lys 


Asn 


Asn 


Asp 


He 


Ala 


Met 


Met 


His 


Leu 


Glu 






115 










120 










125 








Met 


Lys 


Val 


Asn 


Tyr 


Thr 


Asp 


Tyr 


He 


Gin 


Pro 


He 


Cys 


Leu 


Pro 


Glu 




130 










135 










140 










Glu 


Asn 


Gin 


Val 


Phe 


Pro 


Pro 


Gly Arg 


He 


Cys 


Ser 


He 


Ala 


Gly 


Trp 


145 










150 










155 










160 


Gly 


Ala 


Leu 


He 


Tyr 


Gin 


Gly 


Ser 


Thr 


Ala 


Asp 


Val 


Leu 


Gin 


Glu 


Ala 










165 










170 










175 




Asp 


Val 


Pro 


Leu 


Leu 


Ser 


Asn 


Glu 


Lys 


Cys 


Gin 


Gin 


Gin 


Met 


Pro 


Glu 








180 










185 










190 






Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met 


Val 


Cys 


Ala 


Gly 


Tyr 


Glu 


Ala 


Gly 


Gly 






195 










200 










205 








Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Met 


Cys 


Gin 


Glu 




210 










215 










220 










Asn 


Asn 


Arg 


Trp 


Leu 


Leu 


Ala 


Gly Val 


Thr 


Ser 


Phe 


Gly 


Tyr 


Gin 


Cys 


225 










230 










235 










240 


Ala 


Leu 


Pro 


Asn 


Arg 


Pro 


Gly 


Val 


Tyr 


Ala 


Arg 


Val 


Pro 


Arg 


Phe 


Thr 










245 










250 










255 




Glu 


Trp 


He 


Gin 


Ser 


Phe 


Leu 


His 


Glu 


Leu 


Val 


He 


Ser 














260 










265 

















<210> 15 
<211> 12 
<212> PRT 

<213> Artificial Sequence 



<220> 



<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 15 

His Asn His Asn His Asn His Asn His Asn His Asn 
15 10 



<210> 16 

<211> 48 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 16 

catctcatcc acaatgtcca caaagaggag cacgctcatg cccacaac 48 

<210> 17 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 17 

cataaccata accataacca taaccataac cataac 36 

<210> 18 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 18 

catgatgatc atgatgatca tgatgatcat gatgatcatg atgatcatga tgat 54 

<210> 19 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 19 

catgaggagc atgaggagca tgaggagcat gaggagcatg aggagcatga ggag 54 

<210> 20 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



6 



<223> a DNA sequence embodiment of the affinity 
purification site 



<400> 20 

catgatgagc atgatgagca tgagaaccat gagaaccatg aggatcatga ggat 

<210> 21 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> VARIANT 
<222> 9 

<223> Xaa at position 9 is an amino acid with an 
aliphatic or amide side chain. 

<220> 

<221> VARIANT 
<222> 2 

<223> Xaa at position 2 is an amino acid with an 
aliphatic or amide side chain 

<220> 

<221> VARIANT 
<222> 3 

<223> Xaa at position 3 is an amino acid with an 
aliphatic or amide side chain. 



<220> 

<221> VARIANT 
<222> 5 

<223> Xaa at position 5 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 

<220> 

<221> VARIANT 
<222> 6 

<223> Xaa at position 6 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain, 

<220> 

<221> VARIANT 
<222> 7 

<223> Xaa at position 7 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 



<400> 21 

His Xaa Xaa His Xaa Xaa Xaa His Xaa 
1 5 



<210> 22 
<211> 6 
<212> PRT 
<213> Human 

<220> 



<400> 22 

His Arg His Arg His Arg 
1 5 



<210> 23 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> VARIANT 
<222> 2 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 3 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 5 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 6 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 8 

<223> Xaa = an amino acid having an acidic side chain 
<220> 

<221> VARIANT 
<222> 9 

<223> Xaa = an amino acid having an acidic side chain 
<400> 23 

His Xaa Xaa His Xaa Xaa His Xaa Xaa 
1 5 



<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Protease cleavage site 
<400> 24 

Asp Asp Asp Asp Lys 
1 5 



<210> 25 

<211> 21 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 

8 



<400> 25 

Leu Pro Pro Leu Ser Glu Leu lie Pro Leu Ala Ala Ala Glu Arg Pro 

15 10 15 

Ser Ala Ala Ser Gin 
20 



<210> 26 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 26 

Ala Arg Lys Arg Lys Ser Ala Gin Tyr Ala Asn Arg Leu Ser Pro Arg 

15 10 15 

Val Gly Arg Phe He Asn Ala Ala Gly Thr Thr Gly Phe Pro Thr Gly 

20 25 30 

Lys Arg Ala Val Ser Ala 
35 



<210> 27 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 27 

Glu Phe Thr Gly Arg Arg Phe Thr Thr Ser 
15 10 



<210> 28 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 28 

His Asn His Asn His Asn 
1 5 



<210> 29 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 29 



9 



His Asn His Asn His Asn His Asn 
1 ' 5 



<210> 30 

<211> 10 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 30 

His Asn His Asn His Asn His Asn His Asn 
15 10 



<210> 31 

<211> 14 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 31 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn 
15 10 



<210> 32 

<211> 16 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 32 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 
15 10 15 



<210> 33 

<211> 18 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 33 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 

15 10 15 

His Asn 



10 



<210> 34' 

<211> 20 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 34 

His Asn His Asn His Asn His Asn His Asn His Asn His Asn His Asn 

15 10 15 

His Asn His Asn 
20 



11 



